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YEAST PROTEIN EXPRESSION SECRETION SYSTEM 



FIELD OF INVENTION 

5 The present invention relates to novel expression systems for high level and efficient 
expression of insulin as prepro-polypeptides in yeast These pre-propolypeptides are 
efficiently secreted into the extracellular medium, fix)m where they may conveniently 
isolated, converted to native insulin and purified further. 
BACKGROUND TO THE BWENTION 

10 hisulin is a protein hannone that is secreted by the beta cells of the pancreas and plays a 
key role in the homeostasis of blood sugar. A key etiology of diabetes is the reduced or 
the complete cessation of insulin production and secretion by the beta cells, as well as 
resistance to its effects in the peripheral tissues. Thus treatment with insulin remams the 
most effective therapeutic strategy for diabetes, to ameliorate its symptoms as well as its 

15 associated complications. The early treatments with insulin involved the use of the 
hannone isolated from bovine or porcine sowces or from the pancreas of human 
cadavers. The preparation of such insulins, fi"om hxraian, bovme or porcine sources, is a 
highly cumbersome process, associated with difficult purification procedures, very low 
yields, and large amounts of irapxuities. Also, insulins fi'om non-hiunan sources may 

20 cause potentially allergic reactions. However, the tools of recombinant DNA technology 
address most of these difficulties by providing the means to obtain human insulin 
convenientiy, in very high yields and with very high degree of purity. 
The methods of recombinant DNA technology generally consist of isolating or 
synthesizing the gene that encodes a particular protein of interest and cloning the same 

25 into a suitable "heterologous host". The host is then cultured under suitable conditions to 
express the protein to very high levels. The protein may then be conveniently isolated and 
purified fi-om the culture medimn. But several factors effect the final yield and purity of 
even recombinantly expressed protein. These factors basically depend on the choice of 
the expression system, particularly the host culture, employed for the expression of the 

30 protein. The various strains of the bacterium E.coli by far remain •'the hosts of choice" 
for the heterologous expression of proteins. The reasons for this is the mpid generation 
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time ofE.coli and the consequent easy availability of a large biomass, the ease of genetic 
manipulation for generating a high expressing strain, the availability of a plethora of 
"expression vectors'' tailored to the needs of speciiBc RcoU strains for optimal expression 
etc. Yet E.coH expression systems are not without their disadvantages, the most important 

5 being the absence of "modification" systems that would otherwise chemically modify 
proteins of plant and animal origin and that may be crucial to protein function. In 
addition, quite ofl^ proteins are expressed as inactive aggregates ("inclusion bodies") 
inside the Exoli. Isolation of active protein fiom such inclusion bodies involves an 
additional step in the purification procedures, which in turn effects the final yield of the 

10 protein, as well the overall cost of isolation. These particular disadvantages may be 
overcome by expressing a protein in "higher" cellular hosts - either animal or plant cell 
culture systems. But the latter expression hosts are highly expensive, as well as yield 
much lower biomass as compared to E.coli strains. Yeast strains combine the advantages 
of the above distinct host systems. On the one hand they more closely mimic the native 

15 physiology of a plant/animal protein then does E,coli, on the other hand their ease of 
handling, ease of cultivation, much faster growth and much greater economy are typical 
of the advantages provided by E.coli. 

Several factors though, effect the expression of proteins in yeast as well. These factors 
include, but are not confined to: 
20 1) The choice of the gene regulatory sequences, such as promoters, that control the 
expression of an heterologous protein. The promoter sequences employed for 
controlling heterologous expression must typically be "strong," in that they effect 
very high expression of the protein, and suitably "controllable", whereby the 
expression may at first be efficiently repressed until an optimum biomass of the 
25 culture is reached and then quickly "switched on" to effect protein expression. 

2) Efficient secretion of the expressed heterologous protein. Secretion of the expressed 
protein ("extracellular" expression) is often preferred over intracellular expression as 
the latter would first entail breaking open the cell, thus disgorging the entire cellular 
contents, and then isolating the desired protein &om the cesspool of cellular material 
30 and debris. Yet efficient secretion of a protein in turn depends on several factors 
including: a) the choice of the signal sequences - peptide sequences which are usually 
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the N-tenninal regions of naturally secreted proteins, and which direct the protein into 
the cellular secretory pathway and, b) the specific components of the secretory 
pathway that interact with signal sequences and effect the secretion of the attached 
protein. 

5 Clearly there exists an enormous scope for the development of expression systems for 
improved large-scale production of proteins. The present invention provides such a 
system for the expression of insulin in yeast. 

The US patent H24S discloses a plasmid capable of replication and expression in 
E.coli of a human preproinsulin polypeptide, while US patent 4431740 describes a 

10 transfer vector carrying a cDNA of human pre-proinsulin and proinsulin. The US patent 
4916212 claims a DNA sequence encoding an insulin precursor of the formula B(l-29)- 
(Xn-Y)m -A(l-21) where m can be 0 or 1, n = 0 to 33 and X and Y represent amino acid 
sequences specifically defined in the patent, while US patents 5202415 and 5324641 
describe, respectively, insulin precursors and DNA sequences of B(l-29)- X1-X2-Y1- 

15 Y2-B(l-21), where Yl and Y2 each represent basic amino acid residues, US patent 
5962267 claims a precursor of the formula B-Z-A where B and A chains are respectively 
human insulin chains and Z is a specifically defined peptide. US patents 4914026 and 
5015575 teach the expression and secretion of human insulin chains in yeast, particularly 
Saccharomyces, under the control of a promoter fimctional in yeast and the secretion 

20 being directed by a yeast alpha-factor leader sequence fiised to the insulin precursor. Also 
US patent 6337194 describes the expression in yeast of a polypeptide of the general 
fonnula B-Z-A where B and A chains are insulin chains and Z is a peptide region with 
sequences that contain at least one proteolytic cleavage site. Z may fiirther comprise an 
afQnity polypeptide tag for the isolation and purification of the secreted product. ITie US 

25 patents 5389525, 5240838 and 5741672 describe the use of formaldehyde dehydrogenase 
and methanol oxidase respectively in the expression of proteins in the yeast strain 
Hansenula pofymorpha. On the other hand US patents 55414585, 5395922 and 5510249 
describe a polypeptide, consisting of signal and leader peptide sequences and a 
heterlogous polypeptide, that is efficiently processed prior to the secretion of the 

30 heterologous protein in yeast. Furthermore the US patents 5672487 and 5741674 describe 
a process for the recombinant production of protein in yeast, whereby the yeast strain is 
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transformed with an expression cassette consisting of a leader, adapter and a processing 
signal preceding the heterelogous polypeptide. The patent specifically describes the use 
of an adapter polypeptide having an alpha-helical structure. 

The present invention describes the expression of insulin, particularly human insulin, B 
5 and A chains as a fiision protein, fiised to signal peptide sequences, under the control of 
alcohol inducible promoters, such that the fusion polypeptide is very efiBciently expressed 
and secreted from yeasts. 
SUMMERY OF THE INVENTION 

The present invention describes processes for the expression in yeast, of insulin as a 
10 prepro-polypeptide, said polypeptide consisting of a signal sequence, derived from the 
Schwanniomyces occidentalis glucoamylase or Carcinus maenas crustacean 
hyperglycemic harraone signal-leader sequence, and present at N-terminus of an insulin 
polypeptide of the formula: 
B(l-29)-A(l-21) 

15 where B(l-29) and A(l-21) refer to the human insulin B chain from amino acid 1 to 
amino acid 29 and the human insulin A chain from amino acid 1 to amino acid 21 
respectively. 

The said process consists of cloning a gene encoding said prepro-polypeptide into a yeast 
expression system under the control of a yeast alcohol inducible promoter, culturing the 
20 yeast in an appropriate culture medium, isolating the said polypeptide from the culture 
medium, and processing the same to get rid of the signal peptide region and obtain the 
final native form of the human insulin protem. 
DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a composite system for the expression and secretion of 
25 insulin, particularly human insului, in yeast. It consists of expressing insulin as a 
'*prepro"-polypeptide, consisting of two distinct entities - an "insulin region" (the "pro" 
region) and a "signal peptide region" (the "pre" region). The pro-polypeptide region has 
the formula: B(l-29)-A(l-21), where B(l-29) is the B chain polypeptide of insulin, 
preferably human insulin, from amino acid 1 to amino acid 29 and A(l-21) is the A chain 
30 polypeptide of insulin, preferably human insulin, from amino acid 1 to amino acid 21. 
The amino acid 29 of the B chain is coimected directly to the amino acid 1 of the A chain 
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by means of a peptide bond. The said pro-polypeptide B(l-29)-A(l"21) may be converted 
into the "native" insulin - B(1-30):::A(1-21) (where the B and the A chain are no longer 
connected by a peptide bond and instead have 2 interchain and 1 intrachain disulfide 
bonds) by means of a '*transpeptidation" reaction with Threonine-butylester-butylether, in 

5 the presence of the proteolytic enzyme trypsin, followed by hydrolysis (Refer US patents 
4343898 or 4489159). The second entity of the prepro-polypeptide - the signal peptide 
region - is the region that directs the polypeptide into the yeast secretory pathway. This 
region is N-terminus to the insulin polypeptide region and connected to the amino acid 1 
of the B chain by means of a peptide bond. The signal peptide may be derived either from 

10 Schwanniomyces occidentalis glucoamylase signal peptide sequence or Carcinus maenas 
crustacean hyperglycemic harmone signal peptide sequence. In one embodiment of the 
present invention the signal peptide region carries the Kex protease site, that could 
interact with the Kex protease present in the secretory pathway of the yeast expression 
host. Such an interaction would result in the cleavage of the signal peptide region during 

15 the secretion of the heterologous polypeptide. Hence, in this case the polypeptide is 
secreted into the culture medium only as the pro-polypeptide viz. B(l-29)-A(l-21). This 
may then be isolated and converted to the native form (B(1-30):::A(1-21)) by the said 
transpeptidation and hydrolysis reactions (depicted in Figure 1). In a second embodiment 
of the present invention, the signal peptide region does not contain the kex protease site. 

20 In this case the polypeptide secreted into the culture medium is the prepro-polypeptide 
viz. SP-B(l-29)-A(l-21), where SP is the signal peptide region that remains attached to 
the amino acid 1 of the B chain by means of the peptide bond. This second embodiment 
would hence require the in vitro removel of the signal peptide region as well as 
conversion of the B(l-29)-A(l-21) into the '^native" form - B(1-30):::A(1-21). Hence in a 

25 fiirthCT aspect of the second embodiment, the prepro form carries either one basic amino 
acid residue (arginine or lysine) or methionine immediately adjecent and N-terminus to 
the B(l-29)-A(l-21) region. The said basic amino acid residue or methionine residue aid 
the removel the signal peptide region from the B(l-29)-A(l-21) region by means of a 
chemical reaction with either trypsin or cyanogen bromide respectively. Of the two 

30 general embodiment ^escribed above, the second embodiment is preferred over the first 
because, while the the second embodiment does require the additional reaction to remove 
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the signal peptide region, we observe that the yields of the polypeptide obtained by 
following the first embodimmt are much lower then those obtained firom the second 
embodiment. This may, in part, be due to the increased intracellular retention of the 
heterologous protein in the first embodiment. This increased retention may be a result of 

5 the increased interactions with the Kex protease in the secretory pathway, and a 
consequent reduced levels of protein secreted into the culture medium. On the other hand, 
since the heterologous polypeptides (the prepro-polypeptides) of the second embodiment 
do not carry the Kex protease site, there may be reduced interactions betwee]:L the 
polypeptide and the intracellular protease, and a consequent increased levels of secreted 

10 polypeptide. Furthermore, in the case of the second embodiment, between the use of 
either the basic amino acid residue or methionine, we prefer the use of the basic amino 
acid (cleavable with trypsin), because then the secreted form viz. - SP-B(l-29)-A(l-21) 
may be converted directly into the "native" form B(1-30):::A(1-21) by the same 
transpeptidation reaction required for the conversion of B(l-29)-A(l-21) to the native 

15 form - B(1-30):::A(1-21). Thus a single trypsin-transpeptidation reaction would remove 
the signal peptide (SP) region, as well as convert the pro form [(B(l-29)-A(l-21)] into 
the native form [(B(1-30):::A(1-21)] (as dqpicted in Figure 2). Seq ED 1 and 3 are 
examples of the polypeptides representing the first embodiment (viz. with Kex site) and 
Seq ID 2 and 4 are examples of the polypeptides representing the second embodiment 

20 (viz. without kex site). In seq ID 1 and 2 the signal peptide region is derived fi:om 
Schwanniomyces occidentalis glucoamylase signal peptide sequence and in Seq ID 3 and 
4 the signal peptide region is derived from Carcinus maenas crustacean hyperglycemic 
harmone signal peptide sequence. Seq ID S, 6, 7, 8 are examples of DNA sequences 
encoding the polypeptides represented in Seq ID 1, 2, 3, 4 respectively. 

25 The DNA sequences encoding the prepro-polypeptides described above were 

cloned into a yeast expression vector under the control of alcohol inducible promoters. 
Examples of such promoters include the promoters native to the yeast methanol oxidase 
(MOX), formaldehyde dehydrogenase (FMDH), formate dehydrogenase (FMD) and 
dihydroxyacetone synthetase (DHAS) genes. The recombinant expression vectors, 

30 cairying the DNA sequences of the prepro-polypeptides under the control of the alcohol 
inducible promoters, were then transformed into appropriate yeast host strains. Examples 
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of such host strains include genera of Hansenula, Saccharomyces, Pichia, 
Kluyveromyces. The transformed yeast were then cultured in an ^propriate culture 
medium, the polypeptides were isolated firom the medium and then converted into the 
native form. 

5 The present invention thus provides a composite expression system for the very 

high expression of human insulin. The expression system consists of an alcohol inducible 
promoter and the DNA sequence of a "prepro"-polypeptide. The prepro-polypeptide in 
turn consists of the DNA sequence encoding the insulin polypeptide region [B(l-29)- 
A(l-21)] and the DNA sequence encoding either the Schwanniomyces occidentalis 

10 glucoamylase signal peptide sequence or the Carcinus maenas crustacean hyperglycemic 
harmone signal peptide sequence. The prepro-polypeptide may or may not carry the 
sequence recognized by the Kex protease site between the signal peptide region and the 
insulin polypeptide region. If the Kex protease site is absent, then either one basic amino 
acid residue (lysine or arginine) or one methionine residue is present between the signal 

15 peptide region and the insulin polypeptide region. In either case the expressed 
polypeptide is secreted into the intracellular medium, conveniently isolated and further 
processed to obtam the native insuUn. The processing mechanisms are depicted in 
Figures 1 and 2. 

The examples that follow, figures and Seq n)s merely illustrate the invention in 
20 greater detail, but in no way restrict the scope of the same. 
Example 1 

Construction of the recombinant vector carrying the prepro-polypeptides. 

Seq ID 1, 2, 3 and 4 correspond to the amino acid sequences of the prepro-polypeptides 
InGa, InGa-, InCai, InCh-. In the case of Seq ID 1 and 2, the peptide region fi:om amino 

25 acid 1 to 78 is the signal peptide region that ensures the secretion of the heterologous 
proteins. On the other hand the peptide region 79-107 of Seq ID 1 and 2 corresponds to 
amino acids 1-29 of the human insulin B chain, while the peptide region 108 to 128 of 
Seq ID 1 and 2 corresponds to amino acids 1-21 of the human insulin A chain. Similarly, 
in the case of Seq ID 3 and 4, the peptide region fi'om amino acids 1-66 corresponds to 

30 the signal peptide regions, whereas the peptide region 67-116 corresponds to the insulin 
B and A chain regions as above. The signal peptide regions of Seq ID 1 and 2 are derived 
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from Schwanniomyces occidentalis glucoamylase signal peptide sequence, with Seq ID 1 
possessing the kex site, whereas Seq ID 2 not possessing the same. On the other hand, the 
signal peptide regions of Seq ID 3 and 4 are derived JGrom the Carcinus maenas 
crustacean hyperglycemic harmone signal sequence, with Seq ID 3 possessing the kex 

5 site, whereas Seq ID 4 not possessmg &e same. The Seq ED S, 6, 7, 8 correspond to the 
oligonucleotides that encode said prepro-polypeptides InGa, InGa-, InCh, InCh- (defined 
by Seq ID 1, 2, 3, 4). These oligonucleotides were chemically synthesized and designed 
to have those codons that are most optimally expressed in the yeast Hansenula 
polymorpha. The oligonucleotides were cloned into the EcoRI and BamHl restriction 

10 enzyme sites of the plasmid expression vector pMPT121 (Figure 3) by carrying out 
restriction enzyme digestion and ligation reactions by methods well known to those of 
ordinary skill in the art ("Molecular Cloning: A Laboratory Manual" by J. Sambrook, 
E.F. Fritsch and T. Maniatis, n edition, Cold Spring Harbour Laboratory Press, 1989). 
The pMPT121 plasmid e:q)ression vector is based on a pBR322 plasmid and contains the 

15 following elements: 

- standard E, coli pBR322 skeleton including E, coli origin of replication (ori). 

- ampicillin resistance gene for selection of transformed E, colL 

- auxotrophic selective marker gene complementing the auxotrophic deficiency of the 
host - Hansenula polyniorpha, (K polymorpha) (URA3 gene). 

20 - K polymorpha Autonomously R^licating Sequence (HARS). 

- an expression cassette containing the MOX promoter and the MOX terminator for 
insertion of the gene construct and controlling the expression of the cloned 
heterlogous polypeptides in the said yeast strain. 

The individual ligation reactions were then transformed into E.coli hosts by methods 
25 well known to those skilled in the art ("Molecular Cloning: A Laboratory Manual" by J. 
Sambrook, E.F. Fritsch and T. Maniatis, II edition. Cold Spring Harbour Laboratory 
Press, 1989). Various E.coli clones carrying the recombinant plasmids were cultured and 
the plasmids isolated by methods well known in the art ("Molecular Cloning: A 
Laboratory Manual" by J. Sambrook, E.F. Fritsch and T. Maniatis, n edition, Cold Spring 
30 Harbour Laboratory Press, 1989). The isolated recombinant plasmids were then 
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confirmed to be carrying the above oligonucleotides, encoding the respective prepro- 
polypeptides, by DNA sequencing. 
Example 2 

Transformation of a yeast strain with the recombinant vectors carrying the insulin 
5 precursor sequences. 

The recombinant expression plasmids each carrying the oligonucleotides encoding the 
prepro-polypeptides InGa, InGa-, InCh, InCh-, were then transformed into the yeast 
strain K polymorpha that is an ura3 auxotrophic mutant deficient in orotidine-5'- 
phosphate decarboxylase by methods known in the art (Hansenula polymorpha: Biology 
10 and Applications, Ed. G. Gellissen. Wiley- VCH, 2002). The resulting recombinant clones 
were then fiirther used for the expression of the said polypeptides. 
Example 3 

Expression of the insulin precursors in yeast 

The yeast transformants thus obtained were then used for the expression of the insulin 
15 prepro-polypeptides InGa, InGa-, InCh, InCh-. The expression conditions were: 

a) Preculture: Single clones, each carrying the expression vector carrying the 
oUgonucleotide sequences encoding the prepro-polypeptides InGa, InGa-, InCh, 
InCh-, were inoculated into 100 ml of autoclaved 2X YNB/1.5% glycerol medium in 
a 500 ml shake flask with baffles. The composition of the 2X YNB/1.5% is 0.28 g 

20 yeast nitrogen base, 1.0 g ammonium sulfate, 1.5 g glycerol and 100 ml water. The 
cultures were incubated for about 24 h at 37^C with 140 rpm shaking until an O.D600 
of 3-5 is reached. The final pH after incubation is around 2.9-3. 

b) Culture: 2X 450 ml of autoclaved SYN6/1.5% glycerol media in 2X 2000 shake 
flasks with baffles were inoculated with 20-50 ml of each of the above preculture. 

25 The cultures were then incubated for 48 h at 30°C and 140 rpm. The composition of 
the SYN6/1.5% glycerol medium is NH4H2PO4 - 13.3.g, MgS04 x 7H2O - 3.0 g, KCl 
- 3.3 g, NaCl - 0.3 g, glycerol - 15.0 g, water 1000 liters. In addition the following 
solutions (fiher sterilized) were added to the autoclaved media: CaCb solution - 6.7 
ml, microelement solution - 6.7 ml, vitamin solution - 6.7 ml, trace element solution 

30 -3.3 ml. 
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Example 4 

Isolation and estimation of insulin polypeptide precursors. 

1.5 ml of the supematants from the cultures expressing the secreted prepro-polypeptides, 
InGa, InGa-, InCh, InCh- were isolated by centrifiigation and quantified on an analytical 

5 RP-HPLC column (Nucleosil CI 8, 5(im, 2mm x 5Qmm). The buffers employed for 
analysis were: Buffer A: 10% Acetonitrile, 0.1% trifluoroacetic acid in water and Buffer 
B: 80% acetonitrile, 0.1% trifluoroacetic acid in water. The yields of each are expressed 
in Table 1, as total insulin components normalized with dry cell weight, and expressed as 
% yield, with the yields of precursors having the processing site (InGa, InCh) taken as 

10 100% . 



Table 1 



Prepro-polypq)tides 


% Yield 


InGa 


100 


InGa- 


135 


InCh 


100 


InCh- 


127 


Thus the yield of the InGa- and InCh- (v/bic 


1 do not have the Kex site) are, respectively. 



35% and 27% higher then those of InGa and InCh. 
Example 5 

15 Isolation, purification and conversion of the prepro-polypeptides to ^'native" insulin. 
Ceil clarification. 

Culture supematants from example 3 were pooled and clarified by centrifiigation. The 
prepro-polypeptides were then isolated from this diluted supernatant by Cation exchange 
chromatography. 
20 Cation exchange chromatography. 

A Chromatography column of 26mm x 50mm dimensions was packed with 25ml cation 
exchange SP- Sepharose fast flow (Pharmacia) resin and equiUbrated with 20niM citrate 
buffer at pH 4.0. The diluted supematents were applied to the cation exchange column at 
pH 4.0 and a flow rate of 200cm/h. The columns were then washed with 20mM citrate 
25 buffer (5 Column Volumes) at 200cm/h. The bound prepro-polypeptides were eluated 
with a buffer containing lOQmM tris HCl at pH 7.5, at a flow rate of lOOcm/h. About 
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306 mg of prepro-polypeptides were obtained when about 348 mg of prepro-polypeptides 
was applied to the column. 
Isoelectric precipitation. 

300 mg of zinc chloride was added to 306 mg of prepro-polypeptides obtained jfrom the 
5 above described cation exchange chromatogr^hy. The pH was adjusted to 6,0 with HCl 
to precipitate the prepro-polypeptides firom the pool. The reactions were kept at 8^C for 
12 hours followed by centrifiigation and then drying. 
Transpeptidation. 

About 300 mg of precipitated prepro-polypeptides from above were then dissolved and 
10 incubated at 12°C, in a reaction mixture containing 2.36ml of Dimethyl 
sulfoxide/Methanol (50/50 v/v), 1.5 g of L-Threonine-t-butylester-t-butyl ether, 1.44 ml 
milliQ water and 30ul of acetic acid. The reactions were chilled for 5 min in ice. 15.mg 
of trypsin (from bovine pancreas dissolved in 0.255 ml of 50 mM Calcium acetate and 
0.05% acetic acid) was added, pH adjusted to 7.3 and the reaction mixture was incubated 
15 at 12 °C for about 3 hours. The reactions were quenched by reducing the pH to 3.0 with 
IN HCl. This reaction results in the conversion of the prepro-polypeptides to insulin-t- 
butylester-t-butyl ether (refer Figures 1 and 2). 
Purification of Insulin -t-butylester - t-butyl ether. 

From the above reaction mixtures, about 234 mg of the t-butyl ester-t-butyl eth^ 
20 derivatives were diluted 10 fold with 10% 2-propanol containing 0.01% TFA and then 
^pUed to a chromatography column of 20mm x SOmm dimensions and packed with 
25ml reverse phase Amberchrome CG-300 SD resin. The column had been pre- 
equilibrated with buffer A (composition below) and the reaction mixtures ^plied at' a 
flow rate of 100 cm/h. The column is equipped with a binary gradient solvent delivery 
25 system and an online ultraviolet detector. The buffers used were, Buffer A: 10% v/v 2- 
propanol, 0.1% trifluoro acetic acid (TFA) and Buffer B: 80% v/v 2-propanol, 0.1% 
trifluoroacetic acid (TFA). After loading, the column was washed with 5 Column 
Volumes of 20% buffer B at a flow rate of lOOcm/h. The insulm-t-butyl ester-t-butyl 
ether derivatives were eluted with a linear gradient of 20% to 50% buffer B in 7.5 column 
30 volumes at a flow rate of lOOcm/h. The fractions containing pure insulin-ester-ethers 
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were pooled, 2-propanol was removed under reduced pressure and the aqueous phase 

lyophilized to obtain dry insulin-t-butyl ester-t-butyl ether. 

Hydrolysis. 

About 180 mg of lyophilized insulin-t-butyl-ester-t-butyl-ether was hydrolyzed to 

5 "native" insulin in a 100ml round bottom flask by dissolving it in anhydrous 
trifluoroacetic acid at a concentration of 10 mg insulin derivative per ml TFA, in 
presence of 0.5mg tryptophan per ml of TFA. The reaction mixtures were kept at 25 
for 20 min. TFA was removed from the reaction mixture under reduced pressure in a 
Buchi rota evaporator and resuspended the residue mass in 20 ml 1% acetic acid (v/v). 

10 Final HPLC purification. 

About 170 mg insulin obtained from the hydrolysis reaction (described above) was 
filtered to remove particulate matter and applied to a C 18, 10 mm x 250 mm, Vydac 
reverse phase HPLC column equipped with a binary gradient pump and an online 
ultraviolet detector at 280nm. The buffers used were: Buffer A containing 0.2M sodium 

15 sulfate, 20% Acetonitrile and 0.01% TFA, and Buffer B - mixture of 50% Acetonitrile, 
50%water and 0.01% TFA. After loading, the column was washed with 1 Column 
Volume of 20% buffer B at a flow rate of 4ml/min. Insulin was isolated achieved with 
gradient elution that followed washing. During elution the concentration of buffer B 
increased from 20% to 40% over a period of 300 min at a flow rate of 4 ml/min. 

20 Isoelectric precipitation. 

40 mg of zinc chloride was added to a pooled fraction of insulin containing 141 mg of 
insulin (the pools obtained from the above chromatographic process). The pH was raised 
to 6.0 with sodium hydroxide in order to precipitate insulin from the pool as zinc insulin. 
The precipitate was kept at 8 for 12 hours followed by centrifiigation and then dried to 

25 isolate zinc insulin. 

Figure 1: Schematic presentation of the secretion and processing of the insulin pre-pro 
polypeptide possessing the KEX site in the signal sequence region 
Figure 2: Schematic presentation of the secretion and processing of insulin pre-pro 
polypeptide not having the KEX site in the signal sequence region. In this example, there 

30 is a single basic amino acid residue (Arg) just adjacent to the insulin polypeptide region 
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Figure 3: Describes the expression vector (the '"Vector Mq)") used for the expression 
and secretion of heterlogous proteins using the present invention. MOX-promoter refers 
to the alcohol inducible promoter methanol oxidase promoter, MOX-T refers to the 
methanol oxidase terminator. Amp refers to the amplicillin resistance conferring gene and 
5 URA3 is the yeast auxotropic selection marker. The vector map includes the locations of 
the various restriction endonuclease sites of the vector. 
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